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1 <110> APPLICANT: Avalon Pharmaceuticals 

2 <120> TITLE OF INVENTION: Cancer -Linked Genes as Targets for Chemotherapy 

3 <130> FILE REFERENCE: 689290-272 

4 <140> CURRENT APPLICATION NUMBER: US/lO/585 , 466 

5 <141> CURRENT FILING DATE: 2006-07-05 

6 <150> PRIOR APPLICATION NUMBER: 60/534,419 . 

7 <151> PRIOR FILING DATE: 2004-01-06 jO \0 

8 <160> NUMBER OF SEQ ID NOB: 69 fy/ 

9 <170> SOFTWARE: Patentin version 3,3 u"' 

11 <210> SEQ ID NO: 1 ^ 

12 <211> LENGTH: 1181 

13 <212> TYPE: DNA 

14 <213> ORGANISM: Artificial 

15 <220> FEATURE: 

16 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA, 

17 <400> SEQUENCE: 1 



18 cacgaggctg gaaggggcca cttcacacct cgggctcggc ataaagcggc cgccggccgc 60 

19 cggcccccag acgcgccgcc gctgccatgg cccagcccct gtgcccgccg ctctccgagt 120 

20 cctggatgct ctctgcggcc tggggcccaa ctcggcggcc gccgccctcc gacaaggact 180 

21 gcggccgctc cctcgtctcg tccccagact catggggcag caccccagcc gacagccccg 240 

22 tggcgagccc cgcgcggcca ggcaccctcc gggacccccg cgccccctcc gtaggtaggc 3 00 

23 gcggcgcgcg cagcagccgc ctgggcagcg ggcagaggca gagcgccagt gagcgggaga 360 

24 aactgcgcat gcgcacgctg gcccgcgccc tgcacgagct gcgccgcttt ctaccgccgt 420 

25 ccgtggcgcc cgcgggccag agcctgacca agatcgagac gctgcgcctg gctatccgct 480 

26 atatcggcca cctgtcggcc gtgctaggcc tcagcgagga gagtctccag cgccggtgcc 540 

27 ggcagcgcgg tgacgcgggg tcccctcggg gctgcccgct gtgccccgac gactgccccg 600 

28 cgcagatgca gacacggacg caggctgagg ggcaggggca ggggcgcggg ctgggcctgg 660 

29 tatccgccgt ccgcgccggg gcgtcctggg gatccccgcc tgcctgcccc ggagcccgag 720 

30 ctgcacccga gccgcgcgac ccgcctgcgc tgttcgccga ggcggcgtgc ccggaagggc 780 

31 aggcgatgga gccaagccca ccgtccccgc tccttccggg cgacgtgctg gctctgttgg 840 

32 agacctggat gcccctctcg cctctggagt ggctgcctga ggagcccaag tgacaaggga 900 

33 caactgacgc cgtctctgtg agcaccgagg ctttttggcc tcagcacctt cgaagtggtt 960 

34 ccttggcaga ctgcctttcc tggaagaggg cacgggcgat cccgacgggg gcattcctgc 1020 

35 gggtgagagc cgtccccacc gcggcggccc ttctcagccc ctccctccat ggagggaccc 1080 

36 atagggctag acactttgag gcaagcagga ggctctgcct aatgtgaatt tatttatttg 1140 

37 tgaataaact gtactggtgt caaaaaaaaa aaaaaaaaaa a 1181 



39 <210> SEQ ID NO: 2 

40 <211> LENGTH: 268 

41 <212> TYPE: PRT 

42 <213> ORGANISM: Artificial 

43 <220> FEATURE: 

44 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 

45 <400> SEQUENCE: 2 
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81 <210> SEQ ID NO: 3 

82 <211> LENGTH: 397 

83 <212> TYPE: DNA 

84 <213> ORGANISM: Artificial 

85 <220> FEATURE: 

86 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 

87 <400> SEQUENCE: 3 



88 tttttttttt tttttttacc tttttacata tttttttatt ttccttcagc aacatcccag 60 

89 atccttccaa gaagagagtt gttgggaggc ctcaggtctg ggcccttctc agctcctggc 120 

90 tctgcctggc tgctctgtgc tctgtgtcct ctcctttctt tcgcttcctc caaacattgc 180 

91 tccttcaatc ccaggagaag tctcctcgga tgtcagcgcc tctaaagcag cccaaggctt 240 

92 gcctcaattg catggtttcc cgagtcctca gctccagaag accaggcaga tgggtggacc 3 00 

93 ggtgagcagc agggcagccc ctgtgcctct gtctctgccg agtcactccg aagcccggca 360 

94 ggcagcgagg aggagggagt ttctccaagg acagaag 397 



96 <210> SEQ ID NO: 4 
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97 <211> LENGTH: 1693 

98 <212> TYPE: DNA 

99 <213> ORGANISM: Artificial 

100 <220> FEATURE: 

101 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 

102 <400> SEQUENCE: 4 



103 tatccccatc cttagcaggg aggtgctggt catgtgaccc gatgttgaaa ttgacaagct 60 

104 gctagctagt ccgggccttt tccccccccc tttccttttt tttttttcct cctctcccct 120 

105 ccctcccggc ttcctttctt tgtagccacc tcaggggaag caacagatcg tcactcggtg 180 

106 ttctcaccga aagcacgtaa tcgccggtgt aactcatgtt ggctgggggg cctcccggcg 240 

107 cgcgcggaga ggctggggtg cgcccccatg cagcatgctt gtgctcaatt gcagggtcct 3 00 

108 cgttctcgag tgtgcagagg gcggtgagag ctcaactctc gtccccacct cccacccgca 360 

109 gctccccggg tgggtgaggg atgccctgga ctggggatag ccaggtggga gtccgtcgct 420 

110 gtgtggcctg tggtctcgga gtctgttctc ctggagtctc gcatttgcac ccccttcttc 480 

111 gcagtccccc tcccatagac ttgctctggg aagcgcctct gcctccgacc ctagccggaa 540 

112 ccccttcggg gccagagttt gaagccgtgg atgtgcctgc ctggtggctt gtccgatttg 600 

113 cacggtgact tgattacact ctctcattca tggtcacttc cgaagcgctt tagtgccttc 660 

114 cgtccctaaa ccgcctacag ccagaacggc ttctccccgc ggtttgtcac tgatccgcag 720 

115 ggcccggaag ggccttcgtc ttacccggga tccacctctc ccctcatctt ccctgcctac 780 

116 ctcttcatcc caccttctgt ccttggagaa actccctcct cctcgctgcc tgccgggctt 840 

117 cggagtgact cggcagagac agaggcacag gggctgccct gctgctcacc ggtccaccca 900 

118 tctgcctggt cttctggagc tgaggactcg ggaaaccatg caattgaggc aagccttggg 96 0 

119 ctgctttaga ggcgctgaca tccgaggaga cttctcctgg gaggtccaac agccgagctt 102 0 

120 agcccaccgg gctctgggaa agacccgact gaggctaaag ccgccccgga aggccaagtc 1080 

121 cgagttccat ttcttgaaga ggccggcgcg cgtaaggctg tgacattggc cctggcgact 1140 

122 ggcttcccag gagctgttct ttctcaggag ctccacagcg cgggccatct ccagaaaact 1200 

123 gtcttcagag tgtatttcct tttatcgtca acccagagcc ccaccgcggc taatgcaaga 1260 

124 ggccaaaaaa tgtttggagg aagaaaaaca aaggcaggaa gtggcggcgg cctgacggtg 1320 

125 cgtgtgtgtc tgcagagaag ggagggagcc ggctcagtct cttcttgttt ttccaaactt 1380 

126 caaggtccag gcagccctct gcagggccgg gccccattgc tccccgcgcg gcattggagg 1440 

127 tggccgccgg agaggagaag gccaacgcct gcgccaggct tgtcaggcgg aaacggctaa 1500 

128 caaggagatt tggtcagcaa aacagaccca gcctttccga ggcttcgtct gacttggccc 1560 

129 gaaaggttgg ggaggggggg cttgcgcaga gcctcaggga ccctcctctc tggggactac 1620 

130 catccctgag ccttacgctt ctttccacag cctttgcagg cggaatatcg gaataaagtg 1680 

131 ggtccaggcg cca 1693 



133 <210> SEQ ID NO: 5 

134 <211> LENGTH: 76 

135 <212> TYPE: PRT 

136 <213> ORGANISM: Artificial 

137 <220> FEATURE: 

138 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 
13 9 <400> SEQUENCE: 5 



140 Met Phe Gly Gly Arg Lys Thr Lys Ala Gly Ser Gly Gly Gly Leu Thr 

141 15 10 15 

142 Val Arg Val Cys Leu Gin Arg Arg Glu Gly Ala Gly Ser Val Ser Ser 

143 20 25 30 

144 Cys Phe Ser Lys Leu Gin Gly Pro Gly Ser Pro Leu Gin Gly Arg Ala 

145 35 40 45 

146 Pro Leu Leu Pro Ala Arg His Trp Arg Trp Pro Pro Glu Arg Arg Arg 
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151 <210> SEQ ID NO: 6 

152 <211> LENGTH: 3611 

153 <212> TYPE: DNA 

154 <213> ORGANISM: Artificial 

155 <220> FEATURE: 

156 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 

157 <400> SEQUENCE: 6 

158 tatccccatc cttagcaggg aggtgctggt catgtgaccc gatgttgaaa ttgacaagct 60 

159 gctagctagt ccgggccttt tccccccccc tttccttttt tttttttcct cctctcccct 120 

160 ccctcccggc ttcctttctt tgtagccacc tcaggggaag caacagatcg tcactcggtg 180 

161 ttctcaccga aagcacgtaa tcgccggtgt aactcatgtt ggctgggggg cctcccggcg 240 

162 cgcgcggaga ggctggggtg cgcccccatg cagcatgctt gtgctcaatt gcagggtcct 300 

163 cgttctcgag tgtgcagagg gcggtgagag ctcaactctc gtccccacct cccacccgca 360 

164 gctccccggg tgggtgaggg atgccctgga ctggggatag ccaggtggga gtccgtcgct 420 

165 gtgtggcctg tggtctcgga gtctgttctc ctggagtctc gcatttgcac ccccttcttc 480 

166 gcagtccccc tcccatagac ttgctctggg aagcgcctct gcctccgacc ctagccggaa 540 

167 ccccttcggg gccagagttt gaagccgtgg atgtgcctgc ctggtggctt gtccgatttg 600 

168 cacggtgact tgattacact ctctcattca tggtcacttc cgaagcgctt tagtgccttc 660 

169 cgtccctaaa ccgcctacag ccagaacggc ttctccccgc ggtttgtcac tgatccgcag 720 

170 ggcccggaag ggccttcgtc ttacccggga tccacctctc ccctcatctt ccctgcctac 780 

171 ctcttcatcc caccttctgt ccttggagaa actccctcct cctcgctgcc tgccgggctt 840 

172 cggagtgact cggcagagac agaggcacag gggctgccct gctgctcacc ggtccaccca 900 

173 tctgcctggt cttctggagc tgaggactcg ggaaaccatg caattgaggc aagccttggg 960 

174 ctgctttaga ggcgctgaca tccgaggaga cttctcctgg gattgaagga gcaatgtttg 1020 

175 gaggaagcga aagaaaggag aggacacaga gcacagagca gccaggcaga gccaggagct 1080 

176 gagaagggcc cagacctgag gcctcccaac aactctcttc ttggaaggat ctgggatgtt 1140 

177 gctgaaggaa aataaaaaaa tatgtaaaaa gataaccttt tgtttttccc tctccaggaa 1200 

178 atagccaaag ttatttacat atcttgggga gatttagagt ataaactcta agatctttgg 1260 

179 tatttaagtg tcaacatcga tttatttatt tattgctgag ctgactgtaa ctgactcaat 1320 

180 aacaaatcta atcgtgtatt gcactggaaa agaaatattc ttattatgta ttttctccaa 1380 

181 ataatggcct accattgcat ttgaatacct gctgtaaata tcaataatat gaagtaatta 1440 

182 ctctgtagtc gagtaaacta atttattagc attaatgttt atgtggctct ccacctcccc 1500 

183 ccaccaccac ctttacaagg attgcttatc acaaatcaag ctacttggac acattggttt 1560 

184 aatgaactct ttattcaaga tttgctacaa gaattttcat gtcctttgaa tccctgagaa 1620 

185 ctgaacttga aattatttgt gcatcttcag cttgacatat ttgtccactg tggcttgtcc 1680 

186 agagggcagc agtgctgtga gtgagaaggt ccagtgggaa ggaaggttat tggagaagag 1740 

187 tagcctcaga cctcctaaag ctgggaagca catttacaga attgccctgc agcgaaaaaa 1800 

188 cttctatttc cagcagtgac caacaaggca aaatgtttgt ttctccacag catctctttt 1860 

189 ttagagacag aatataaaag acagaaggag aggtttcaaa cagcggtatt cgagctgttc 1920 

190 cctggcttga ttttggctat cccaagctct ctctctgcag cccacccagt ccacgccacc 1980 

191 cctaccttcg aacaaaagga atgcatgaag ggtttcagtg actttgccat aacaaaggcg 2040 

192 ccaccattgc ggggctcgcc ccgcccctgg gtgaaggcaa acaaattctt gcacttgtat 2100 

193 tagggctttt aagaccataa ttgaacccgg gggcgtctag gaaaaccgaa aacagttcta 2160 

194 gacagacctg cggttttata gcagttttgg cagtcaactt cagcttgtgc ctgagcagac 2220 

195 ggggttgtgg tggcccgcca gcgggggatg ccaggccacc tcccccagcg gcacgcagcc 2280 

196 cctctcttaa ttagatcggt tttcccctgg tgtccgggag agcggtcccg gcagaaaggt 2340 
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197 cggtatgggg gtgtgcgctg ttccgcataa ccactgcctc ccatgtcctc ctcgagggcc 2400 

198 gaaccgagag ggtgctggca gggctggatc ccacgggtgt ccgcaggaga caaaggcgaa 2460 

199 ttccggagaa aaggctgggg ctgagaaagg cgctccggga gcggctggca gggcaattcg 2520 

200 gcaggctgca ccgaagccga gtgcccggag ggacttgccg cccggaaggg ggtgtgtggg 2580 

201 ggcgctgccg tgaagatgga tgagggaaaa ggtttttgat atcagcagaa gggaaaacgc 2640 

202 ctggagtggc cgaacacttt tagttgccca gcaggaatag gagacgggta ctcagctccc 2700 

203 caaggctgcg caatatccca gctttgcccg ctcctgccct cgtgttcgga atatgctggc 2760 

204 ggtgtgaatg tgaaggtttc tccaggcagg cggctggggc gaggggtggg cgcagctctg 2 82 0 

205 aggatcagac caaatctagg ggaaaaaggg gaggcagacc ttcgggtcct tggttttgac 2880 

206 tttgctctga gcctatgatt gactcttccg ctttgcagga ggtccaacag ccgagcttag 2940 

207 cccaccgggc tctgggaaag acccgactga ggctaaagcc gccccggaag gccaagtccg 3000 

208 agttccattt cttgaagagg ccggcgcgcg taaggctgtg acattggccc tggcgactgg 3060 

209 cttcccagga gctgttcttt ctcaggagct ccacagcgcg ggccatctcc agaaaactgt 3120 

210 cttcagagtg tatttccttt tatcgtcaac ccagagcccc accgcggcta atgcaagagg 3180 

211 ccaaaaaatg tttggaggaa gaaaaacaaa ggcaggaagt ggcggcggcc tgacggtgcg 3240 

212 tgtgtgtctg cagagaaggg agggagccgg ctcagtctct tcttgttttt ccaaacttca 3 3 00 

213 aggtccaggc agccctctgc agggccgggc cccattgctc cccgcgcggc attggaggtg 3360 

214 gccgccggag aggagaaggc caacgcctgc gccaggcttg tcaggcggaa acggctaaca 3420 

215 aggagatttg gtcagcaaaa cagacccagc ctttccgagg cttcgtctga cttggcccga 3480 

216 aaggttgggg agggggggct tgcgcagagc ctcagggacc ctcctctctg gggactacca 3540 

217 tccctgagcc ttacgcttct ttccacagcc tttgcaggcg gaatatcgga ataaagtggg 3600 

218 tccaggcgcc a 3611 



220 <210> SEQ ID NO: 7 

221 <211> LENGTH: 82 

222 <212> TYPE: PRT 

223 <213> ORGANISM: Artificial 

224 <220> FEATURE: 

225 <22 3> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 

226 <400> SEQUENCE: 7 



227 Met Gly Val Cys Ala Val Pro His Asn His Cys Leu Pro Cys Pro Pro 

228 1 5 10 15 

229 Arg Gly Pro Asn Arg Glu Gly Ala Gly Arg Ala Gly Ser His Gly Cys 

230 20 25 30 

231 Pro Gin Glu Thr Lys Ala Asn Ser Gly Glu Lys Ala Gly Ala Glu Lys 

232 35 40 45 

233 Gly Ala Pro Gly Ala Ala Gly Arg Ala lie Arg Gin Ala Ala Pro Lys 

234 50 55 60 

23 5 Pro Ser Ala Arg Arg Asp Leu Pro Pro Gly Arg Gly Cys Val Gly Ala 

236 65 70 75 80 

237 Leu Pro 



239 <210> SEQ ID NO: 8 

240 <211> LENGTH: 3264 

241 <212> TYPE: DNA 

242 <213> ORGANISM: Artificial 

243 <220> FEATURE: 

244 <223> OTHER INFORMATION: cDNA or putative protein derived from a cDNA. 

245 <400> SEQUENCE: 8 

246 tatccccatc cttagcaggg aggtgctggt catgtgaccc gatgttgaaa ttgacaagct 60 

247 gctagctagt ccgggccttt tccccccccc tttccttttt tttttttcct cctctcccct 120 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:20; N Pes. 2,6,14,725,782,828 

Invalid <213> Response; 

Use of "Artificial" only as "<213> Organism" response is inconqplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#: 1,2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 
Seq#:28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41,42, 43,44, 45, 46, 47, 48, 49, 50, 51 
Seq#:52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69 
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VERIFICATION SUMMARY DATE: 07/21/2006 

PATENT APPLICATION: US/10/585,466 TIME: 18:33:55 

Input Set : N:\Crf4\07192006\J585466.raw 
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L:773 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:20 after pos.:0 
M:341 Repeated in SeqNo=20 
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